Remerge release/0.2 to main by tastelikefeet · Pull Request #145 · modelscope/twinkle

tastelikefeet · 2026-04-09T09:13:46Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

gemini-code-assist

Code Review

This pull request primarily focuses on updating the project's dependencies, versioning, and migrating various code examples and documentation to use the Qwen3_5Template instead of the generic Template. It also refactors Dockerfile package installations, integrates Megatron installations directly, and clarifies LoRA weight synchronization behavior in vLLM. Review comments highlight redundant package installations in the Dockerfile, a full-width character in English documentation, a typo in Chinese documentation, and a less deterministic version constraint for the datasets package.

gemini-code-assist · 2026-04-09T09:21:14Z

Dockerfile

+RUN pip install numpy==2.2 --no-cache-dir
+
+# Install tinker, ray, and other deps
 RUN pip install --no-cache-dir tinker==0.14.0 "ray[serve]" transformers peft accelerate -U


The packages transformers, peft, and accelerate are already installed on line 22. Installing them again here is redundant and increases the Docker image size and build time. You should remove them from this command.

RUN pip install --no-cache-dir tinker==0.14.0 "ray[serve]" -U

gemini-code-assist · 2026-04-09T09:21:14Z

docs/source_en/Components/Template/Template.md

+
+Currently, the model-template mapping is simple:
+
+- Template class：Supported in all pure text LLMs.


The colon used here (：) is a full-width character. In English documentation, a standard colon (:) should be used for consistency with the rest of the file (e.g., line 59).

Suggested change

- Template class：Supported in all pure text LLMs.

- Template class: Supported in all pure text LLMs.

gemini-code-assist · 2026-04-09T09:21:15Z

docs/source_zh/组件/检查点引擎/CheckpointEngine.md

 - **HCCLCheckpointEngine**: 适用于昇腾 NPU 环境

 > 检查点引擎是 RLHF 训练基础设施的关键组件,确保训练器和采样器使用一致的模型权重。
+> 目前的同步分为merge_and_sync=True/False两种情况，为True时将lora合并仅基模并同步，为False时仅同步lora权重。另外，多租户直接附加lora文件到vLLM上，在merge_and_sync=False，或使用多租户时，


Typo: 合并仅 should be 合并进 (merged into).

Suggested change

> 目前的同步分为merge_and_sync=True/False两种情况，为True时将lora合并仅基模并同步，为False时仅同步lora权重。另外，多租户直接附加lora文件到vLLM上，在merge_and_sync=False，或使用多租户时，

> 目前的同步分为merge_and_sync=True/False两种情况，为True时将lora合并进基模并同步，为False时仅同步lora权重。另外，多租户直接附加lora文件到vLLM上，在merge_and_sync=False，或使用多租户时，

gemini-code-assist · 2026-04-09T09:21:15Z

pyproject.toml

 dependencies = [
  "numpy>=2.0.0,<2.3.0",
-  "datasets>=3.0,<4.0",
+  "datasets",


Removing the version constraint for datasets (previously 3.0,<4.0) makes the installation less deterministic and prone to breaking if a new major version of datasets is released. It is recommended to keep a version range constraint.

Suggested change

"datasets",

"datasets>=3.0,<4.0",

tastelikefeet and others added 19 commits April 5, 2026 20:09

bump version to 0.2.0

9072c90

fix

967093a

fix

68aa05b

fix dockerfile

1eeb423

remove oss2

b4c5fbf

fix install

fe6e866

fix server config

b2d1466

fix trust_remote_code

1c20c5e

fix install script

4c7cdfa

fix support models

4086a54

fix torchrun

b55ba2c

Merge branch 'main' into release/0.2

230f89a

Merge commit '8c3dbc90a70235eb4d2409e5a6c6f5f2dff7fd8a' into release/0.2

050e9b5

fix version

6558796

add no-cache-dir

031be8f

fix deps version

80e9e58

fix deps version

e3f37c9

update dockerfile

9342d74

fix docs

d1c2d8e

tastelikefeet closed this Apr 9, 2026

gemini-code-assist bot reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remerge release/0.2 to main#145

Remerge release/0.2 to main#145
tastelikefeet wants to merge 19 commits intomainfrom
release/0.2

tastelikefeet commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		Currently, the model-template mapping is simple:

		- Template class：Supported in all pure text LLMs.

	- Template class：Supported in all pure text LLMs.
	- Template class: Supported in all pure text LLMs.

	> 目前的同步分为merge_and_sync=True/False两种情况，为True时将lora合并仅基模并同步，为False时仅同步lora权重。另外，多租户直接附加lora文件到vLLM上，在merge_and_sync=False，或使用多租户时，
	> 目前的同步分为merge_and_sync=True/False两种情况，为True时将lora合并进基模并同步，为False时仅同步lora权重。另外，多租户直接附加lora文件到vLLM上，在merge_and_sync=False，或使用多租户时，

Conversation

tastelikefeet commented Apr 9, 2026

PR type

PR information

Experiment results

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant